Use of Meta-data for Value-level Inconsistency Detection and Resolution During Data Integration

نویسنده

  • Philipp Anokhin
چکیده

This paper addresses the data integration problem: there exists a collection of autonomous heterogeneous information sources that need to be integrated; users want to be able to query the collection transparently and to get a single, unambiguous answer. The sources may conflict with each other on three levels: their schemas, data representation, or data themselves. One has to resolve the conflicts that may arise during the integration to get the single answer to a query. Most of the approaches in this area of research resolve inconsistencies among different schemas and data representations, and ignore the possibility of data value-level conflict altogether. The few that do acknowledge its existence are mostly probabilistic approaches which just detect the conflict and provide a user with some additional information on the nature of the inconsistency (e.g. give a set of conflicting values with attached probabilities). We propose an extension to the relational data model that makes use of meta-data of the information sources called properties. This extension gives ground to a flexible data integration technique described in this paper that consists of three phases: (1) query result construction, (2) data conflict detection and (3) data conflict resolution. An improvement to data clustering techniques in the conflict detection phase is presented in the paper. It uses another type of meta-data available from the sources (source descriptions in terms of the virtual database schema) to narrow down the areas of possible data conflicts. For the last phase, a flexible data-level conflict resolution algorithm is offered, which incorporates both contentand property-based approaches. The algorithm is guided by user-defined priorities of properties and by domain-based resolution strategies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of Visible Image and LIDAR Altimetric Data for Semi-Automatic Detection and Measuring the Boundari of Features

This paper presents a new method for detecting the features using LiDAR data and visible images. The proposed features detection algorithm has the lowest dependency on region and the type of sensor used for imaging, and about any input LiDAR and image data, including visible bands (red, green and blue) with high spatial resolution, identify features with acceptable accuracy. In the proposed app...

متن کامل

Data Integration: Inconsistency Detection and Resolution Based on Source Properties

This paper addresses the problem of integration of multiple heterogeneous information sources. The sources may conflict with each other on the following three levels: their schema, data representation, or data themselves. Most of the approaches in this area of research resolve inconsistencies among different schemas and data representations, and ignore the possibility of data-level conflict alt...

متن کامل

Statistical downscaling of GRACE gravity satellite-derived groundwater level data

With the continued threat from climate change, population growth and followed by increasing water demand, the need for hydrological data with high spatial resolution and proper time coverage to be felt more than ago. Therefore, having data such as terrestrial water storage changes and groundwater level changes with high resolution spatial helps to plan and make decisions for water resource mana...

متن کامل

Diagnostic Value of Urinary Neutrophil Gelatinase-Associated Lipocalin (NGAL) in Detection of Pediatric Acute Kidney Injury; a Systematic Review and Meta-Analysis

Background: Two questions about diagnostic value of urinary neutrophil gelatin associated lipocalin (uNGAL) in detection of acute kidney injury (AKI) in children have remained unanswered; first, which cut-off point of uNGAL has the highest value in detection of AKI; and second when is the best time for measuring this biomarker in a patient? Accordingly, the present study aimed to conduct a syst...

متن کامل

Integration of Deep Learning Algorithms and Bilateral Filters with the Purpose of Building Extraction from Mono Optical Aerial Imagery

The problem of extracting the building from mono optical aerial imagery with high spatial resolution is always considered as an important challenge to prepare the maps. The goal of the current research is to take advantage of the semantic segmentation of mono optical aerial imagery to extract the building which is realized based on the combination of deep convolutional neural networks (DCNN) an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005